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INTRODUCTION 

Field of the Invention 

25 The field of this invention is analyte detection, particularly labels employed in 

analyte detection. 

Background of the Invention 

Analyte detection, in which a sample is assayed for the presence of a particular 

analyte of interest, is critical to a variety of fields. For example, analyte detection 
30 protocols are employed in the characterization of complex mixtures, the identification 

and characterization of novel compounds of interest, and the like, both in industry and 

academia. Analyte detection also plays a role in medicine, particularly in the diagnosis 
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of disease. In such applications, a sample from a patient is assayed for the presence of 
one or more specific analytes indicative of the disease being diagnosed. 
Environmental monitoring applications, such as the monitoring of pollutants and 
toxins, also employs analyte detection protocols. 

Where an analyte of interest is not readily measured directly, the use of a 
detectable label is required. A variety of detectable labels have been developed over 
the years for use in analyte detection. These include both directly and indirectly 
detectable labels. Directly detectable labels include radioisotopes, enzymes, 
fluorescent and chemiluminescent agents, or other labels capable of being directly 
detected. Where an indirectly detectable label is employed, it is generally a member 
of a signal producing system such as a second stage antibody or reagent that is used 
to amplify the signal. Such reagents are well known in the art. For example, the 
primary antibody may be conjugated to biotin, with horseradish peroxidase-conjugated 
avidin added as a second stage reagent. Alternatively, the secondary antibody may be 
conjugated to a fluorescent compound, e.g. fluorescein, rhodamine, Texas red, etc. 
Final detection uses a substrate that undergoes a color change in the presence of the 
peroxidase. 

Despite the wide variety of different labeling agents that have been developed 
over the years for use in analyte detection, there is still an interest in developing new 
ones. Of particular interest would be the development of sensitive labels that can 
simultaneously detect small quantities of analytes present in complex mixtures by 
methods that do not require preparation steps prior to detection. 
Relevant Literature 

U.S. Patents of interest include: 5,573,905 and 5,723,598. Also of interest are 
Akeson et al., "Microsecond time-scale discrimination between polycytidylic acid, 
polyadenylic acid and polyuridylic acid as homopolymers or as segments within single 
acid as homopolymers or as segments within RNA molecules" Biophys.J (1999) 
77:3227-3233; Wonderlin et al., "Optimizing planar lipid bilayer single-channel 
recordings for high resolution with rapid voltage steps" Biophys. J. (1990) 58:289- 
297; and Kasianowicz, et al., "Characterization of individual polynucleotide molecules 
using a membrane channel," Proc. Natl. Acad. Sci, USA (1996) 93: 13770-13773. See 
also, Benoit et al., J. Am. Chem. Soc, (1999) 121:3904; Hawker et al., Paper 136, 
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PMSE Division, 217^^ ACS Meeting, Anaheim (1999); Hawker et al., Acc. Chem. Res. 
(1997) 30:373; Dao et al., J. Polymer Science Part A-Polymer Chemistry (1998) 
36:2161-2167; Keck et al., J. Org. Chem. (1996) 61 :359. 



Targeted molecular bar codes and methods for their use in analyte detection are 
provided. The subject targeted molecular bar codes comprise a molecular bar code 
bonded to a member of a specific binding pair, usually through a linking group. The 
molecular bar codes are charged polymeric compounds, preferably negatively charged 
compounds, e.g. negatively charged block copolymers, that can be detected by a 
nanopore. The subject targeted molecular bar codes find use in a variety of different 
analyte detection applications, including screening and diagnostic applications. 

BRIEF DESCRIPTION OF THE FIGURES 
Figure 1 provides a schematic diagram of a targeted molecular bar code. 

Figure 2 provides a schematic of a nanopore device that may be used to 
examine molecular bar codes. 

Figure 3 provides an example of a current blockade profile caused by a 
molecular bar code. Specifically, Figure 3 provides an example of a current blockade 
profile caused by a molecular bar code. A single a-hemolysin channel was inserted 
into a bilayer giving an open current of 126 pA at 120 mV in 1 M KCl buffer. During 
control measurements in the absence of a bar code, virtually no blockades were 
observed. Addition of A(30)C(70)Gp bar code to the cis bath immediately produced 
blockade events at a rate of approximately 1 per second. In this experiment, most 
bilevel events first exhibited 5 pA residual current (95 percent current blockade, solid 
arrow) followed by a 19 pA residual current (84 percent blockade, dashed arrow). 

Figure 4 provides a representation of a general approach using targeted 
molecular bar codes. 
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SUMMARY OF THE INVENTION 





Figure 5 shows the current fluctuations caused by a polydT(18) bar code. 
Comparison between unimpeded current in a nanopore and current fluctuations cause 
by a polydT(18) bar code. The tope tracing (A) shows unimpeded current through a 
single a-hemolysin channel at 120 mV in 1 M KCl. The lower tracing (B) shows 
5 blockades of the same channel current by polydT 18 mers at 20 |aM concentration. At 
this concentration, the blockade frequency was about 50 per second. The duration of 
the blockades averaged about 50 microseconds. 

Figure 6 shows the nanopore current blockades characteristic of a 
polyphosphate bar code and a polyC bar code. Nanopore current blockades 

10 characteristic of a polyphosphate bar code and a polyC bar code are shown. (A) 
Typical blockades caused by a nominal 65 mer of polyphosphate. The average 
amplitude of these blockades was 30 pA relative to the unimpeded channel current. 
(B) Typical blockades caused by a polycytidylic acid 125 mer. The average amplitude 
of these blockades was 115 pA. These two examples show the ca.85 pA range of 

15 blockade amplitudes that can be built into individual bar code blocks. This is seven- 
fold greater than the 12 pA difference that has been shown to be discernible within 
single RNA strands using a commercial patch clamp amplifier and the horizontal thin 
film device. 

Figure 7. Current profile analysis demonstrating that a nanopore device can 
20 discriminate between different bar codes in a mixture, (a) is polyA; (b) is polyC and 
(c) is polyA+polyC. 

Figure 8. Current profile analysis for a bar code comprised of polydT 
oligomers linked to polyethylene glycol (CI 8) spacers. Specifically, this figure shows 
the event pattern caused by a block copolymer comprised of polydT DNA oligomers 
25 linked via phosphodiester bonds to non-DNA polyethylene glycol (CI 8) spacers. This 
molecular bar code (polydT(33)/PEG18/dT(i)/PEG18/polydT(34)) was synthesized on an 
ABI 392 synthesizer using conventional phosphoramidite chemistry. 

Figure 9 provides a schematic diagram of the protocol used to detect N-ras, as 
described in the Experimental Section, Infra. 
30 Figure 10. Current profile analysis of N-ras bar code. 
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Figure 1 1 . Comparison of blockade durations for a) polydC oligomer with an 
internal 10 bp hairpin and b) a polydC oligomer with the same nucleotide composition 
but in random order. 

Figure 12. Blockade duration as a function of number of base pairs in hairpins 
inserted into polydC oligomers. The upper line (dark diamonds) represents the 
duration mode for oligomers with base-paired hairpins. The lower line (open circles) 
represents the duration mode for oligomers with the same nucleotide composition but 
randomized. 

Figure 13. Discrimination between polydC oligomers with 10 base-pair 
hairpins and oligomers with randomized sequence based on blockade duration. Each 
population includes approximately 1000 blockade events. 

Figure 14. A model molecular bar code containing two 6 base-pair hairpins 
designed to control the rate of translocation through the nanopore. 

Figure 15. Current blockades caused by the oligonucleotide diagrammed in 
Figure 14. The arrows indicate an abrupt current increase that correlates with 
occupancy of the pore by an abasic segment built between poly dC segments. 

Figure 16. Schematic of living polymerization reaction using BraslauA^ladimir 
initiator. 

Figure 17. Representative hydrophilic monomers that may be used in the 
molecular bar codes. 

Figure 18. Representative motifs in copolymers of the subject invention. 

Figure 19. Charged or ionizable monomers that may be employed in the 
subject molecular bar codes. 

Figure 20. Derivitizable monomers that may be employed in the subject 
molecular bar codes. 

Figure 2 1 . Motifs of different branched polymers that may be employed in the 
subject bar codes. 

Figure 22. Schematic for preparation of functionalized nitroxide 
BraslauA^ladimir initiator. 

Figure 23. Schematic for preparation of secondary hydroxy group at the 
terminus of a bar code. 
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DETAILED DESCRIPTION OF THE INVENTION 
Targeted molecular bar codes and methods for their use in analyte detection are 
provided. The subject targeted molecular bar codes include a molecular bar code 
stably associated with, e.g. covalently attached, typically through a linking group, to a 
5 member of a specific binding pair ( see Figure 1). The molecular bar code is generally 
a charged polymeric compound, and in preferred embodiments, is a negatively charged 
or anionic compound, e.g. a negatively charged block copolymer. The subject targeted 
molecular bar codes find use in a variety of different analyte detection applications, 
including screening and diagnostic applications. In further describing the subject 
10 invention, the targeted molecular bar codes will be described first followed by a 
discussion of methods for their use in analyte detection. 

Before the subject invention is described fiarther, it is to be understood that the 
invention is not limited to the particular embodiments of the invention described 
below, as variations of the particular embodiments may be made and still fall within 
15 the scope of the appended claims. It is also to be understood that the terminology 

employed is for the purpose of describing particular embodiments, and is not intended 
to be limiting. Instead, the scope of the present invention will be established by the 
appended claims. 

In this specification and the appended claims, the singular forms "a," "an," and 
20 "the" include plural reference unless the context clearly dictates otherwise. Unless 
defined otherwise, all technical and scientific terms used herein have the same 
meaning as commonly understood to one of ordinary skill in the art to which this 
invention belongs. 

A critical feature of the targeted molecular bar code of the subject invention is 
25 the molecular bar code. The molecular bar code is a polymeric molecule that is 

capable of being translocated through a nanopore under the influence of an applied 
electric field (see Figure 2). Importantly, during translocation of the molecular bar 
code through the nanopore, a reproducible signal is produced, for example a current 
blockade profile ( see Figure 3). Other signals may include fluorescence, 
30 chemiluminescence, or variations in tunneling current. By current blockade profile is 
meant the collection of current blockade data points plotted versus a given period of 
time upon application of an applied electric field to a nanopore. Components of the 
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profile that may be employed include one or more of amplitude, duration and pattern, 
where usually two or more of these characteristics are detected when looking at the 
current blockade profile. The given period of time that a single bar code molecule is 
examined is generally at least about 10 microseconds, usually at least about 100 
microseconds and more usually at least about 250 microseconds, and may be as long 
as 10 milliseconds or longer, but will usually not exceed about 5 milliseconds in 
length. The current blockade data points are derived from the observed change in ionic 
current through the nanopore from the cis to the trans side upon occupancy by the 
polymeric molecular bar code. Suitable current measurement devices, as well as 
hardware and software necessary for generating the current blockade profile from the 
observed changes in current, are well known to those of skill in the art and a 
representative system is disclosed in the Experimental Section, infra. 

The polymeric molecular bar code is sufficiently long to generate a distinctive, 
reproducible signature or current blockade profile such that the molecular bar code 
provides a distinctive tag for the specific binding member with which it is associated, 
(e.g. covalently attached) in the targeted molecular bar code. As such, the length of 
the polymeric molecular bar code is at least about 10 nm, usually at least about 20 nm 
and more usually at least about 50 nm, where the length may be as long as 1000 nm or 
longer, but will usually not exceed about 500 nm and more usually will not exceed 
about 400 nm. The polymeric molecular bar code is either a linear or non linear, e.g. 
branched polymer, where non-linear polymers of interest include comb polymers, star 
polymers, and dendritic polymers. In many embodiments, the polymer is generally a 
linear molecule. The linear polymeric molecule may or may not assume a secondary 
structure under the conditions in which it is employed, e.g. physiologically acceptable 
aqueous solutions, where the secondary structure may be a helical configuration, a 
loop, e.g a hairpin loop, etc. The only limitation with respect to the physical 
dimensions of the molecular bar code is that it be able to assume a configuration that 
allows it to pass or translocate lengthwise through the nanopore from the cis side to the 
trans side. 

The polymeric molecular bar code is generally a charged molecule such that it 
moves under the influence of an applied electric field. While in the broadest sense the 
molecular bar code may be positively or negatively charged, it is generally negatively 
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charged. In other words, the molecular bar code is typically an anionic polymeric 
molecule. 

The molecular bar code may be a homopolymer or a copolymer of two or more 
different types of monomers. As indicated above, the only limitation on the molecular 
5 bar code is that it generate a distinct and reproducible signal upon translocation 

through a nanopore, as described in greater detail infra. Block copolymers of interest 
include polymers of 2 or more up to 10 or more different types of monomeric units, 
where copolymers of interest include random, gradient and block copolymers. 

In many embodiments of interest, the molecular bar code is a block copolymer 

10 in which a plurality of blocks are covalently bonded to each other in sequential 
fashion. By plurality of blocks is meant at least 2, where the block copolymer will 
generally have from 2 to 20 blocks, usually from 2 to 10 blocks and in many cases 
from 3 to 10 blocks. In the molecular bar codes, the number of unique blocks may be 
the same as the total number of blocks in the copolymer or it may be less, e.g. in 

15 molecular bar codes in which a given unique block is repeated one or more times. In 
many preferred embodiments, the total number of distinct blocks in the copolymer is 
less than the total number of blocks in the polymer. In these preferred embodiments, 
the total number of distinct blocks in any given copolymer will generally range from 1 
to 6, and usually from 2 to 4, with 3 distinct blocks being found in many molecular 

20 bar codes of particular interest. 

Each block in the polymer may be heteropolymeric or homopolymeric. As 
such, each block in the polymer may be made up of the same monomeric units or a 
plurality of different monomeric units. Where the block is a heteropolymer made up of 
a plurality of monomeric units, the number of different monomeric units is at least 2, 

25 and may be as high as 5 or 7 or more, but is generally not more than 4. In many 

embodiments, the heteropolymeric block is a polymer of 2 or 3 different monomeric 
units. 

The length of each block within a molecular bar code will be sufficient to 
generate a reproducible current blockade or other signal as the block translocates 
30 through a nanopore under an applied electric field. As such, the length of each block 
will generally range from about 1 or 5 to 50, usually from about 5 or 10 to 30 and 
more usually from about 10 or 15 to 20 or 25 nm, where in many embodiments the 
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length will be about 15 to 20 nm. Depending on the nature of the monomeric units of 
the block, the number of monomeric units in a particular block may vary, where the 
number of monomeric units in a block will range from about 1 to 100, usually from 
about 3 to 60 and more usually from about 5 to 40. 
5 In many preferred embodiments, the block(s) of the negatively charged or 

anionic polymeric bar codes are homopolymers of monomeric units that include a 
phosphate group (or phosphorothioate group), where the phosphate group (or 
phosphorothioate group) is not a pendant group but a member of the polymeric 
backbone of the bar code, i.e. the backbone of the polymer includes the presence of 

10 phosphodiester bonds. Of particular interest as monomeric units are phosphates or 
sugar phosphates. When the monomeric units are sugar phosphates, the sugar moiety 
of the monomeric unit will generally be from 4 to 6 carbon atoms, where the carbon 
atoms will generally form a cyclic structure, e.g. a pentose or hexose. Of particular 
interest are 6 carbon sugars forming a five-sided ring structure, specifically ribose and 

15 deoxyribose. The monomeric units may further include one or more pendant groups, 
such as heterocyclic nitrogenous bases, e.g. purines and pyrimidines. Specific purines 
of interest include naturally occurring and synthetic purines, where naturally occurring 
purines such as adenine and guanine, specifically adenine, are of particular interest. 
Likewise, specific pyrimidines of interest include naturally occurring and synthetic 

20 pyrimidines, where naturally occurring pyrimidines such as cytosine, thymine and 
uracil, particularly uracil and cytosine, are of particular interest. In many 
embodiments, the purines and pyrimidines are modified to minimize hydrogen 
bonding between pairs, e.g. Watson-Crick base-pairing is minimized. 

The block(s) of the molecular bar code are, in many embodiments. 



25 homopolymers of monomeric units having the structural formula: 
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wherein: 

R is H or a heterocyclic nitrogenous base, e.g. a heterocyclic nitrogenous base 
that may or may not be capable of participating in Watson-Crick base pairing 
interactions, such as a purine or a pyrimidine; 

Y is H or OH; 

n is 0 or 1 ; 

m is an integer ranging from 10 to 80; and 
q is 0 or 1 . 

Specific polymeric blocks of interest include: polycytidylic acid, polyadenylic 
acid, polyuridylic acid, polyphosphate, oligodeoxyribose phosphate (i.e. abasic 
polydeoxyribose phosphate), polyethylene glycol-phosphodiesters, oligonucleotides 
(including modified oligodeoxynucleotides such as oligo-5-nitroindole that do not 
form Watson-Crick base pairs), polystyrenes, polyacrylic acids, polyacrylates, 
polyacetonitriles, polyvinylphosphonates, polyacetamides and the like. Of particular 
interest in certain embodiments are molecular bar codes that are fabricated from 
polymers with biologically inert backbones, where such polymers include 
polystyrenes, polyacrylic acids, polyacrylates, polyacetonitriles, 
polyvinylphosphonates, polyacetamides and the like. 

In many embodiments, the polymeric compound that is the molecular bar code 
is not a deoxyribonucleic acid in which all of the bases are selected from the group 
containing adenine, cytosine, thymine, and guanine bases. For example, where one of 
the blocks of a copolymer is an oligodeoxyribonucleotide comprised of one of these 
bases, the molecular bar code will usually comprise at least one additional non- 
oligodeoxyribonucleotide, such as polyacrylic acid, polystyrene, polyacetamides, 
polyphosphate, oligodeoxyribosephosphate, etc. Accordingly, the entire molecular bar 
code will not be a deoxyribonucleic acid in these particular embodiments. 

To further modify the properties of the molecular bar code, e.g. the profile of 
the bar code generated upon translocation through a nanopore, a number of different 
types of elements can be incorporated into the bar codes. For example, secondary 
structures may be built into the bar code, as mentioned above. Of particular interest in 
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certain embodiments are hairpin structures, which may be built into the bar code to 
modulate the profile generated by the bar code, extend the translocation time of the 
barcode through the pore, etc. 

Hairpin structures form when a single polymer contains regions of 
complementary sequence separated by several non-complementary units. The 
polymer strand folds on itself so that the complementary regions associate, forming the 
stem of the hairpin. The non-complementary units connecting the complementary 
regions form a loop. Hairpin stem structures of interest vary in length from about 1 to 
24 bp, but typically range from about 2 to 1 2 and usually from about 2 to 1 0 bp in 
length. As such, the length of the polymeric stretch that gives rise to the hairpin under 
the conditions of use of the bar code typically ranges in certain embodiments from 
about 36 to 94 monomers, usually from about 36 to 68 monomers and more usually 
from about 56 to 64 monomers; while in other embodiments from about 6 to 52 
monomers, usually from about 8 to 28 monomers and more usually from about 8 to 24 
monomers. The polymeric stretch or domain may be made up of any sequence of 
monomeric residues, where the residues may be naturally occurring or synthetic 
residues (i.e. non-naturally occurring residues) so long as a portion of the residues, 
generally the residues at either end of the polymeric domain, are capable of 
hybridizing to each other analogous to a Watson-Crick base pair type interaction. In 
other words, the polymeric domain that gives rise to the hairpin loop has two flanking 
regions that are complementary to each other, where the length of each complementary 
flanking region in terms of monomeric units ranges in length from about 1 to 24 units, 
usually from about 2 to 12 units and more usually from about 2 to 10 units. As such, 
the residues are typically nucleotides, either naturally occurring or synthetic. 
Separating the flanking residues that participate in the base pairing interaction 
described above to form the hairpin loop structure is a domain of residues that do not 
participate in hybridization but serve as a spacer group. Generally, the length of this 
spacer group in terms of monomeric units ranges from about 1 to 6 units, usually from 
about 2 to 5 units, and more usually from about 3 to 4 units. 

Specific bar codes of interest include: poly cytidy lie acid, poly adenylic acid, 
polyuridylic acid, polyA/polyC block copolymers, polyC/polyU block copolymers, 
polydT comprised of 5 to 150 monomers, random DNA oligonucleotides, abasic 
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deoxyribose-phosphate polymers, DNA or non-DNA polymers containing base-paired 
hairpins, DNA hairpins, polyacrylic acids, polyacetamides, polyacrylates, and 
polystyrenes. 

Stably associated with, and preferably covalently attached to, the molecular bar 
code of the targeted molecular bar code, is the second requisite moiety— the member of 
a specific binding pair. The specific binding pair member may be attached either 
directly to the molecular bar code or, preferably, through a linking group, more 
particularly a cleave-able linking group. By cleave-able linking group is meant that the 
linking group can be cleaved under conditions that do not disrupt, e.g. hydrolyze, the 
molecular bar code but serve to release the entire molecular bar code from the specific 
binding pair member of the targeted molecular bar code. Preferably, linkers are those 
that can be incorporated into the synthesis of the barcode, even where automated 
synthesis protocols are employed. The linking group may be a variety of different 
moieties, where suitable groups include: disulfide groups, restriction sites, 
photocleavable groups, and the like. Chemically cleavable linkers of interest are those 
that include disulfide groups, dimethoxytrityl groups, etc. Restriction sites of interest 
that may may be present in the linking groups include: EcoR I, Sma I, Kpn I, Dra I 
and the like. Photocleavable groups of interest include: vanillin based photolabile 
linkers, and the like. 

The specific binding pair member may be any moiety capable of specifically 
interacting with a second moiety, e.g. an analyte of interest. As such, the specific 
binding pair member of the targeted molecular bar code may be a ligand or receptor, 
an antibody or binding fragment thereof, e.g. Fv, F(ab )2 and Fab, an antigen 
recognized by a circulating antibody and the like; a nucleic acid, e.g. oligonucleotide, 
etc., where the nature of the specific binding pair member will depend on the nature of 
the analyte of interest, e.g. whether the analyte is a receptor or other cell surface 
antigen, a ligand, RNA, DNA, etc. In some cases the polymer portion of the targeted 
molecular bar code may serve as the second requisite moiety, i.e. the specific binding 
pair that binds with the analyte. 

The subject targeted molecular bar codes can be fabricated using any 
convenient protocol, where a variety of protocols are known to those of skill in the art. 
For example, the polymeric molecular bar code can be synthesized in whole or in 
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parts, where the parts are subsequently joined together. The resuhant polymeric 
molecular bar code can then be conjugated to the binding moiety to produce the 
targeted molecular bar code. The manner by which the targeted molecular bar code is 
produced is not critical to the invention. The particular method employed will depend, 
at least in part, on the nature of the targeted molecular bar code and its constituent 
parts. For example, for those components of the bar code that are nucleic acids, 
chemical synthesizers and/or recombinant techniques may be employed. For those 
components that are biologically inert polymers, e.g. polystyrenes, polyacrylates, etc., 
synthetic polymerization protocols can be employed, where of particular protocols of 
interest are living polymerizations that yield products of low polydispersity and 
controlled molecular weights, e.g. as reported in Benoit et al., J. Am. Chem. Soc. 
(1999) 121 : 3904. Representative procedures for producing targeted molecular bar 
codes is provided in the experimental section, infra. 

The subject targeted molecular bar codes find use in a variety of applications, 
particularly analyte detection applications in which a sample is assayed for the 
presence of an analyte(s) of interest. A wide variety of analytes may be detected using 
the subject targeted molecular bar codes, where such analytes include both naturally 
occurring and synthetic compounds, e.g. biological analytes, such as antibodies, 
receptors, ligands, peptides, proteins, nucleic acids (DNA, RNA), viruses, bacteria, 
toxins, etc.; and environmental analytes, e.g. toxins, pollutants, etc.; and the like. 

The sample that is screened in the subject methods may be obtained from a 
variety of sources. Samples, as used herein, include biological fluids such as blood, 
cerebrospinal fluid, tears, saliva, lymph, dialysis fluid, semen and the like; organ or 
tissue culture derived fluids; food and fluids extracted from cells or physiological 
tissues, where the cells may be dissociated, in the case of solid tissues, or tissue 
sections may be analyzed. Alternatively a lysate of the cells may be prepared. Other 
samples of interest include environmental samples, such as plant tissue samples, 
ground water samples, soil samples, and derivatives of such samples. Because of the 
nanoscale dimensions of the poer detector, the sample that is screened may be as small 
as a single biological cell (typically of 1 0 \xm diameter) or as small as a defined target 
within a biological cell such as organelles, vescicles etc. (typically of 1 |Lim diameter or 
less). 
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In performing the subject methods of analyte detection, the first step is to 
contact the sample suspected of comprising the analyte of interest with a targeted 
molecular bar code having a specific binding pair member specific for the analyte of 
interest. The term "contact" is used in the broadest sense to mean any type of 
5 combining action which brings the targeted molecular bar into sufficiently close 

proximity with the analyte of interest in the sample such that a binding interaction will 
occur if the analyte of interest specific for the specific binding pair member of the 
targeted bar code is present in the sample. Contacting can be achieved in a variety of 
different ways, including introducing the targeted molecular bar code into the sample, 
10 and the like. 

Following contact or introduction of the targeted molecular bar code into the 
sample, the resultant reaction mixture is incubated for a sufficient period time for any 
specific binding interactions, e.g. ligand-receptor binding, hybridization, etc., between 
the specific binding pair member and analyte (if present) in the sample to occur. The 

15 particular incubation conditions will vary depending on the specific nature of the 

analyte of interest and the specific binding pair member, where such conditions can be 
readily determined by those of skill in the art. For example, where the specific binding 
pair member is an antibody and the analyte of interest is an antigen, the incubation 
solution is generally buffered in the range of about pH 6.5-9.5. The incubation time 

20 should be sufficient for the targeted molecular bar code to bind available analyte 

molecules. Generally, from about 0.1 to 3 hr is sufficient, usually 1 hr sufficing. In an 
alternative example, where the analyte of interest is a particular DNA and the specific 
binding member is a DNA (or hybridizing mimetic thereof, e.g. PNA) having a 
sequence complementary to the sequence of the DNA of interest, conditions sufficient 

25 for hybridization are employed, where the conditions will generally be stringent 

conditions, for example, at 50°C or higher and O.lxSSC (15 mM sodium chloride/01.5 
mM sodium citrate). Incubation results in the formation of specific binding complexes 
between the specific binding pair component of the targeted molecular bar code and 
the analyte(s) of interest, if present, in the sample being assayed. 

30 Following complex formation, any unbound targeted molecular bar code is 

separated from the analyte/targeted molecular bar code complexes, i.e. the bound 
targeted molecular bar codes. Any convenient separation protocol may be employed, 
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where the particular separation protocol that is chosen in a given assay will depend 
primarily on the nature of the analyte. Thus, where the analyte is large, e.g. a cell, one 
means of separation includes centrifugation, with cell bound targeted molecular bar 
codes being separated from unbound targeted molecular bar codes that are free in 
solution. Alternatively, where the analyte of interest, if present, is fixed to a solid 
support, e.g. a polynucleotide blot on nitrocellulose (such as a Northern blot), the 
substrate may be separated from the sample and washed to remove unbound targeted 
molecular bar codes. In yet other embodiments, the separation protocol may involve 
size separation protocols, with the analyte bound targeted molecular bar codes being 
separated from the unbound targeted molecular bar codes by methods that discriminate 
and separate based on size, such as chromatography, gel electrophoresis etc. 

Following separation of the targeted molecular bar code/analyte complexes 
from the unbound or free targeted molecular bar codes, as well as any other sample 
constituents, the molecular bar code of the analyte/targeted molecular bar code 
complex is separated from the remainder of the complex, i.e. the specific binding pair 
member and the analyte. Separation generally is achieved through cleavage of the 
linker group. As such, separation typically involves treating the complex with an agent 
that cleaves the linking group. The agent that is chosen thus necessarily depends on the 
nature of the linking group. For example, where the linking group is a photocleave- 
able group, the agent may be light of appropriate wavelength to disrupt or cleave the 
photocleave-able group. In many embodiments, the wavelength of light used to cleave 
the photo-cleavable linking group ranges from about 180 nm to 400 nm, usually from 
about 250 nm to 400 nm and more usually from about 300 nm to 400nm. 
Alternatively, where the agent is a chemically cleave-able group, the agent will be a 
chemical agent capable of cleaving the group, .e.g. dithiothreitol or beta- 
mercaptoethanol where the linking group is a disulfide. In yet other embodiments 
where the linking group is a restriction site, the agent is a catalytic agent, e.g. a 
restriction enzyme. Importantly, this cleavage step should not involve the use of agents 
which disrupt the nature of the molecular bar code, e.g. that hydrolyze it. This 
cleavage step results in the production of a cleavage product mixture comprising free 
molecular bar code and analyte/specific binding member complex. 
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Following the production of the cleavage product mixture, the free molecular 
bar codes present therein, if any, are detected and related to the presence of the 
analyte(s) of interest in the sample. Detection may or may not involve separation of 
the free molecular bar code from the remainder of the cleavage product mixture, as 
5 may be convenient or desired. The free molecular bar code is detected using any 
convenient protocol. Thus, in an embodiment where the bar code is capable of 
participating in Watson-Crick base pair interactions, the bar code may be detected with 
a complementary labeled nucleic acid, e.g. radioactive or fluorescently labeled RNA, 
where this detection step may or may not involve an amplification step, e.g. a PCR 



A particularly preferred means of detecting the molecular bar code is to 
translocate the molecular bar code through a nanopore under the influence of an 
applied electric field and observe the effect over time of the translocation on a 
measurable signal. One such measurable signal is blockade of ion current through a 

1 5 nanopore which generates a unique current blockade profile for each molecular bar 

code. In other words, the molecular bar code is "scanned" by translocating it through a 
nanopore, where the output of the scan is the current blockade profile. Once the 
current blockade profile is obtained, it can be compared to reference current blockade 
profiles to determine the exact identity of the specific binding member to which the 

20 molecular bar code was originally bound in the initial targeted molecular bar code. 
This comparing step and identification step can be done manually but is ideally 
performed by an appropriate computer hardware/software system. The above process 
is summarized in Figure 4. 



25 is typically a device that comprises a nanopore inserted into a thin film with means for 
applying an electric field across the nanopore and for measuring the resultant signal at 
the nanopore. See Figure 2. By nanopore is meant a structure having a channel or pore 
with a diameter of "nano" dimensions, where the inner diameter of the pore or channel 
typically ranges from about Ito 10, usually from about 1 to 5 and more usually from 

30 about 1 to 2 nm. The nanopore may be synthetic or naturally occurring, where 

naturally occurring nanopores include oligomeric protein channels, such as porins, 
gramicidins, and synthetic peptides and the like, where a particularly preferred protein 
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Step. 



The nanopore device that is employed in the above preferred detection method 
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channel is the self-assembled heptameric channel of a -hemolysin. In one 
embodiment, the thin film into which the nanopore is inserted is a lipid bilayer 
fabricated from a wide variety of one or more different lipids, where suitable lipids 
include: phosphatidly choline, phosphatidylserine, phosphatidylethanolamine, glycerol 
mono-oleate, and cholesterol. 

A variety of suitable thin film support devices have been reported in the 
literature that may be used to support the nanopore used to detect the molecular bar 
code. Such devices include those described in: Brutyan et al., Biochimica et 
Biophysica Acta (1995) 1236:339-344; Wonderlin et al., Biophys. J. (1990) 58:289- 
297; Suarez-Isla et al. Biochemistry (1983) 22:2319-2323 as well as those disclosed 
and reviewed in U.S. Patent Application Serial No. 08/405,735 entitled 
"Characterization of Individual Polymer Molecules Based on Monomer-Interface 
Interactions" filed on March 17, 1995 and having a University of California reference 
number of 91-287-2. Of particular interest is the device described in co-pending U.S. 
Patent Application No. 60/107,307 filed November 6, 1998, the disclosure of which is 
herein incorporated by reference. 

In detecting or "scanning" the molecular bar code with a nanopore device, the 
first step is to place the molecular bar code to be detected, e.g. the released or free 
molecular bar code generated from the cleavage step described above, on the cis side 
of the nanopore. The free molecular bar code will generally be in an aqueous solution, 
e.g. a buffered solution, where the solution typically comprises one or more dissolved 
salts, such as potassium chloride and the like, and the pH ranges from about 6.0 to 9.0, 
and more usually from about 7.0 to 8.5. The solution on the trans side of the nanopore 
may be the same or different from the solution on the cis side, but will also generally 
be an ionic buffered solution. After the free molecular bar code is placed on the cis 
side of the pore, an electric field is applied across the pore, conveniently by electrodes 
positioned in the cis and trans side of the pore. The electric field that is applied is 
sufficient to translocate the free molecular bar codes through the nanopore, and may 
range from about 50,000 to 500,000 volts per cm, where the applied electric field will 
typically range from about 100,000 to 400,000 volts per cm and more usually from 
about 1 50,000 to 300,000 volts per cm. During translocation of the molecular bar 
code through the nanopore, the ion current through the pore is measured. 
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Measurements of individual molecular bar codes will typically be made at least every 
1 s, usually at least every 0.1 s and more usually at least every 0.02 s using a single 
nanopore. The measured data values are then manipulated to produce a current 
blockade profile or similar output capable of being compared against reference outputs 
such that the identity of the molecular bar code can be determined. The presence of 
analyte (if any) in the sample is then related to the quantity of its specific molecular 
bar code detected that was detected by the bar code. 

Because of the specific nature of the subject analyte detection assays 
employing the subject targeted molecular bar codes, amplification of the analyte or 
label associated therewith need not be performed prior to detection. Thus, the subject 
methods may be used to detect analyte in a sample at very low levels, where the 
amount may be as low as 20 picomole or lower but will at least be 1 zeptomole, and in 
certain embodiments at least 1 attomole. In addition, the presence of the analyte can be 
detected rapidly, usually in less than about 2 hours, and more usually in less than about 
1 hour, where the time required for analyte detection may be as short as 1 minute or 
shorter, but will be at least 5 minutes in many embodiments. 

The above analyte detection methods find use in a variety of different 
applications in which the detection of one or more analytes of interest is desired. 
Applications in which the subject methods find use include diagnostic and screening 
applications. Diagnostic applications include applications in which the detection of 
one or more specific analytes in a complex physiological mixture, such as the samples 
described above, is desired. In such applications, the presence of the analyte(s) of 
interest will generally be indicative of a particular disease or condition in the host from 
which the screened sample is derived. Thus, in diagnostic applications according to the 
subject invention, the presence of one or more analytes of interest is detected in a 
sample from the subject being diagnosed using the methods described above and then 
related to the presence or absence of a disease or condition. 

In screening applications based on the subject methods, the subject targeted 
molecular bar codes are targeted, i.e. have a specific binding pair member that binds to 
an analyte of interest such as a toxin, bacterium, virion, pollutant and the like. As such, 
the screening applications can be used to screen physiological or environmental 
samples for the presence of man-made or naturally occurring toxins, pollutants, etc. 
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Also provided by the subject invention are kits comprising the subject targeted 
molecular bar codes. The number of different types of targeted bar codes in the kit 
may range widely depending on the intended use of the kit, where the number may 
range from 1 to 500,000 or higher, where any two targeted bar codes are considered to 
5 be of different type if their molecular bar code and specific binding pair member 

components are different. Thus, kits according to the subject invention may comprise a 
single type of targeted molecular bar code or 2 or more different types of targeted 
molecular bar codes, including sets of 4 different types of targeted molecular bar 
codes, sets of 10 different types of targeted molecular bar codes, sets of 64 different 

10 types of targeted molecular bar codes, sets of 100 different types of targeted molecular 
bar codes, sets of 1,000 different types of targeted molecular bar codes, sets of 
100,000 different types of targeted molecular bar codes and sets of 500,000 types of 
targeted molecular bar codes. Thus, of interest are kits with at least 5, usually at least 
10, more usually at least 15 different molecular bar codes, where the number of bar 

15 codes in these kits may be as high as 20, 100, 200, 500, 1000, 5000, 10,000, 20,0000 
or more. In addition to the targeted molecular bar codes, the kits may further comprise 
one or more additional assay components, such as suitable buffer media, and the like. 
The kits may also include a device for detecting and scanning or reading the bar codes, 
such as those described supra, where the devices disclosed in copending U.S. Patent 

20 Application serial no. 60/107,307 filed on November 6, 1998, are of particular interest. 
Finally, the kits may comprise instructions for using the targeted bar codes in methods 
of analyte detection according to the subject invention, where these instructions for use 
may be present on the kit packaging and/or on a package insert. 

25 The following examples are offered by way of illustration and not by way of 

limitation. 

EXPERIMENTAL 

I. Preparation of a Horizontal Bilayer Containing a Single Nanopore Using a 
30 Miniature Horizontal Support 
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A single channel was inserted into a bilayer on the horizontal aperture as 
follows: 

A. Formation of diphytanoyl PC/hexadecane bilayers on a horizontal aperture. 

A miniature support was manufactured as described under 'Preparation of 
Subject Devices' as disclosed in copending application serial no. 60/107,307 filed 
November 8, 1998, the disclosure of which is herein incorporated by reference. A lipid 
bilayer was then formed as follows. The aperture and the Teflon bath holding the 
aperture were first cleaned for 10 min in boiling 5% nitric acid, then rinsed in 
nanopure water. Just before use, the aperture and bath were rinsed with ethanol 
followed by hexane, and then air dried. The aperture was then coated with a thin film 
of diphytanoyl PC (obtained from Avanti Polar Lipids, Birmingham, Alabama) by 
applying 5 |il of a 200 jig per mh solution in spectroscopy grade hexane which was 
then evaporated with a light stream of air injected through the U-tube from the trans 
side. The chambers on both sides of the aperture were then filled with 65 |al of buffer 
composed of 1 .0 M KCl, 5 mM HEPES/KOH at pH 7.5. Silver chloride electrodes 
manufactured using standard methods were placed directly into each bath and were 
attached to an Axopatch 200B amplifier. To paint a bilayer, a single one-centimeter- 
long bristle on a 000 brush was dipped into a 3 mg per mh diphytanoyl PC solution in 
spectroscopy grade hexadecane. The bristle was then gently brushed across the 
aperture as viewed by a standard dissecting microscope. A 5 mV, 60 cycle square 
wave was applied across the aperture as a seal test. Once a seal was achieved the 
aperture was brushed repeatedly with a clean bristle until a capacitance of about 0.6 
|j,F cm" was achieved. 

B. Insertion of individual a-hemolysin channels into the 
DiphytanoylPC/hexadecane bilayer. 
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a-hemolysin lyophilized in phosphate buffer (Calbiochem, La Jolla, CA) was 
dissolved in nanopure water at 2 |j.g per |il and dispensed as 2 \x\ aliquots into 0.2 mh 
polypropylene tubes. These aliquots were frozen at -20^ C. On the day of an 
experiment, a single tube of toxin was placed on ice and diluted in 1 .0 M KCl/HEPES 
buffer to a final concentration of 0.04 |j.g per |j.l. One ]li1 of this diluted stock was added 
to the cis side of the bilayer and mixed gently. Voltage (120 mV trans positive) was 
then applied across the bilayer. A single channel typically inserted into the bilayer 
within 10-60 minutes as indicated by an abrupt increase in current. In the event that 
no channel insertion was observed in one hour, a second 0.04 jug aliquot of toxin was 
added. This generally resulted in a channel within an additional 1 5 minutes. Upon 
channel insertion, the cis chamber was immediately perfused with 2 mL of buffer, i.e. 
about 30 times the bath volume. Single rectifying channels (ca. 120 pA current in IM 
KCl at 120 mV potential trans positive) were used immediately for molecular bar code 
analysis. 

II. Use of the Device 

The nanopore device described above was used to characterize molecular bar 
codes as follows: 

A. Preparation of Molecular Bar Codes 

1 . Preparation of RNA Homopolymer Molecular Bar Codes 

Homopolymers of polycytidylic acid and polyadenylic acid (2000+ nt) were 
purchased from Fluka (Ronkonkoma, NY). To generate shorter fragments, 5 mg of 
full length RNA homopolymers were weighed into a 12 mL polypropylene tube. To 
this was added 1 mL of alkaline buffer (pH 10.2, 40 mM NaHC03, 60 mM Na2C03) 

pre-warmed to 60*^ C. For a product ranging in size from 100 to 500 nt in length, the 
solution was incubated at 60*^ C for 23.5 minutes and the reaction stopped by adding 
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100 ^1 of 3 M sodium acetate, pH 5.2, and 50 |al of 10% glacial acetic acid. The RNA 
was precipitated in 2.5 volumes of ethanol at -20^ C. The pellet was rinsed in 80% 
ethanol, then redissolved in 1 volume water and 1 volume 2X formamide loading 
buffer (90% formamide, 10% lOx MOPS RNA buffer). The product was loaded on an 
8% polyacrylamide/MOPS gel and run at 4 volts per cm alongside RNA markers 
(Century Markers, Ambion Inc., Austin, TX). The gel was then examined by UV 
shadowing and RNA fragments of varying length were excised and eluted from the gel 
by electrophoresis. The sized RNA was then ethanol-precipitated and redissolved in 
water or pH 7.5 TE buffer at 2-to-5 \xg per |al. 

2. Synthesis of a Molecular Bar Code Comprised of an RNA Block Copolymer 

A 134 base DNA oligo-nucleotide composed of the sequence 
TAATACGACTCACTATAGGGA(A29)/C(7o)GGTACCACACAC (SEQ ID NO:01) 
was purchased from Midland Certified Reagents (Midland, TX), Full-length 134 nt 



strands were separated from incomplete strands by electrophoresis on an 8% 
preparative PAGE/TBE gel at 100 V for 4 hours. The desired band was excised, the 
full length material was electroeluted from the gel slice, precipitated in ethanol, rinsed 
twice with 80% ethanol, air-dried, then dissolved in water to give a final concentration 
of 1 \xg per |LiL 

Double-stranded template was synthesized from the purified single-stranded 

134mer using Sequenase (Amersham/U.S. Biochemical, Cleveland, OH). Briefly, 1 

\xg of the 134mer (25 pmol final) were combined with 0.2 |Lig of a 14 base reverse 

complement to the 3' end of the 1 34 mer (50 pmol final), 4 \il of Sequenase 5X 

buffer, and 3 \il nanopure water. This mixture was heated to 65^ C for 2 minutes and 

gradually cooled to 4*^ over 30 minutes to permit annealing of the reverse complement 

to the 134 nt strand. This solution was then heated to 37^ for two minutes and 
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combined with 1 |al 0.1 M DTT, 2.4 |al of a 2.5 wM dNTP mixture at room 
temperature, and 6 of pure water. This solution was brought to 37*^ C for 1 minute, 
combined with 1 jil of 13U per \il Sequenase and then incubated at 37^ C for 45 
minutes. The resulting double-stranded DNA product was stored at -20^ C. 

RNA was synthesized using the 134nt double-stranded DNA template and a T7 
RNA polymerase-based kit designed to give very high yields of short transcripts 
(Megashortscript, Ambion Inc., Austin, TX). Briefly, we combined, in order, at room 
temperature, 4 |al nanopure water, 2 |al lOX transcription buffer, 2 \il each of 75 mM 
ATP, CTP, UTP, GTP, 4 |al of dsDNA template from the previous step, and 2 |al 
Megashortscript(T7 RNA polymerase) enzyme stock. This mixture was incubated at 
37^ C for 2 hours. At the end of the incubation, 1 \il of 2U/|il DNAse 1 was added 
along with 0.25 jag of RNAse Tl (Life Technologies) to cleave undesired ends of the 
RNA product at G residues. This digestion was incubated at 37^ for 15 minutes. The 
product was then run on an 8% PAGE gel in 1 x MOPS RNA buffer af 80 V. The 
desired 101 nt band was excised and eluted by electrophoresis. The elution buffer was 
then exchanged for pH 7.5 TE buffer using a Bio-Rad 30 spin column (Hercules, CA). 
The final product was stored at 2 jag per ^1 in a -20^ C freezer. 

3. Synthesis of Molecular Bar Codes Using a DNA Synthesizer 

Molecular bar codes were synthesized by conventional phosphoramidite 
chemistry using an automated device. The coupling and deprotection steps were in 
accordance with standard methods recommended by the manufacturer of the DNA 
synthesizer. The synthesis scale ranged from 40 nmol to 1 |amol, and the column 
support was typically dT-CPG with a pore size of 500 angstroms to 1000 angstroms. 
These syntheses were typically performed 'trityl on' after the last coupling step, 
permitting purification of full length molecular bar codes using a widely practiced 
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column chromatography procedure. An example molecular bar code synthesized 
using 69 coupling reactions at 0.2 umol scale was: 



PolydT(33) /PEG(i8) /dT(,)/PEG(i8)/polydT(34) 
A current blockade profile for this molecular bar code is illustrated in Figure 8. 



In one embodiment, an automated DNA synthesizer is used to directly couple a 
bar code to a DNA oligomer that is a reverse complement to a targeted gene ( e.g. 
Xenopus elongation factor). A cleave-able site (the Xba I recognition sequence) is 
built into the synthetic strand between the bar code and the specific binding pair 
member using a hairpin and a C18 spacer. The complete targeted molecular bar code is 



-^paGer-^8/GTGTGGAA-TGAGAAeGA-TGTT-&^^^ 

This molecule is applied to a membrane blot that is thought to contain Xef mRNA. 
Following equilibration, unbound targeted molecular bar code is washed from the 
surface. The bound material is then digested with Xba I which cuts the strand at the 
Xba I recognition sequence in the hairpin stem, thus releasing the 
5nitroindole(20)/Spacer9/dT(l)/Spacer9/dT(17) molecular bar code for analysis using 
a nanopore. 

B. Current Blockades Produced by Molecular Bar Codes 

1 . Single channel current recordings. 

Current readings across single a-hemolysin channels were acquired using an 
Axopatch 200B integrating patch clamp amplifier (Axon Instruments, Foster City, 
CA) in voltage clamp mode. Unless otherwise noted, data were acquired at 1 0 |j,s 
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intervals in the whole cell configuration and were filtered at 10 kHz using a low-pass 
bessel filter. The analog signal was digitized using an Axon Instruments Digidata 
1200 Series Interface, and then stored using pClamp 6.02 software (Axon Instruments, 
Foster City, CA). Before the addition of molecular bar codes to the cis chamber, data 
5 were acquired in gap free format for 15 seconds each at 0, +120m V, and -120 mV. 
Molecular bar codes (10 -15 |j.g unless otherwise noted) were added to the cis 
chamber, and blockades of current were examined at 120 mV (trans positive) for five 
minutes. Blockades were stored in pClamp 6.02 using the event driven format. 

2. Quantitative discrimination between RNA molecular bar codes using a 
1 0 nanopore 

Blockades of ionic current caused by occupancy of the a-hemolysin pore by 
the polyadenylic acid (polyA RNA) bar code were measured first. The polyA 
blockades (Fig. 7a) fell into three populations: i) relatively short (<200 |lis) blockades 
that reduced the current by 40-60 pA; ii) blockades of indeterminate length that 

15 reduced the channel current by 65 pA to a residual current of about 50 pA (55% 
blockades); and iii) blockades of 1 .5 to 2.5 ms that reduced the channel current by 
about 98 pA to 19 pA of residual current (84% blockades). The duration of the third 
class of blockades was strand-length dependent, whereas the duration of the first two 
classes was length independent (data not shown). The pattern of blockades caused by 

20 polyC bar codes was easily distinguishable from the pattern for polyA bar codes 
whether the polymers were examined separately (Figs 7a and 7b) or in combination 
(Fig 7c). That is, the channel current was reduced significantly more by the polyC 
RNA bar code (typically 95% blockades) than by the polyA RNA bar code (84% 
blockades) and the polyC blockades were shorter in duration, averaging 6 [is per 

25 nucleotide compared to 16 |is per nucleotide for polyA. Also, polyC RNA rarely 

induced lower amplitude blockades or biphasic blockades that were very common with 
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polyA RNA. This constitutes proof of principle that the nanopore can quantitatively 
discriminate between a mixture of molecular bar codes in an aqueous solution. 

3. Detection of blocks within individual RNA molecular bar codes 

The preceding experiment suggested that a transition from polyA to polyC 
blocks within individual bar codes should be detectable by the a-hemolysin pore. To 
test this prediction, in vitro transcription and ribonuclease Tl digestion was used to 
generate a 101-nucleotide-long RNA bar code of the nominal composition 
A(30)C(70)Gp (section IIA.2 above). Typical biphasic blockades caused by 
occupancy of the channel by this bar code are shown in Figure 3. As predicted, one 
component of the blockade reduced the channel current by 95% (consistent with the 
polyC block of the bar code), and the other component reduced the current by 84 % 
(consistent with the polyA block of the bar code). This proves the principle that 
blocks can be read along single molecular bar codes using a nanopore. 

4. Establishment of a unique current blockade profile for a molecular bar code 
comprised of DNA and non-DNA blocks 

A molecular bar code of the composition: 

polydT(33) /PEG(i8) /dT(i) /PEG(i8) /polydT(34) 

was synthesized on an ABI Automated DNA synthesizer using conventional 
phosphoramidite chemistry as outlined in Section ILA.3 above. This molecule was 
then examined using a nanopore as described in Section II.B.l, revealing a unique 
current blockade profile (Figure 8) that was readily distinguishable from the other 
molecular bar codes listed in this application 
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III. 



Detection of N-ras in a Complex Sample 



A. 



Synthesis of an N-ras targeted molecular bar code and its use for specifically 



detecting a segment of N-ras Exon 1. 

A targeted molecular bar code was synthesized at 1 |imol scale on an ABI 392 
DNA Synthesizer using standard conditions. The targeted bar code (top of Figure 9) 
was composed of a poly dTis mer at its 5' end, linked by a disulfide bond (Glen 
Research, Sterling, VA) to a 50 base-long, anti-sense segment of N-ras Exon 1. This 
targeted bar code (10 i^M final concentration) was hybridized at 40 degrees C for four 
hours in 6x SSC to either a 40 base segment of N-ras Exon 1 (sense strand) or to a 70 
base segment of the p53 open reading frame. Each of the target oligonucleotides was 
attached to polystyrene beds to facilitate recovery and washing of the bound bar code. 
Following hybridization, the beads were washed twice in 0.2 x SSC at room 
temperature. The beads were then suspended in 20 fil of 0.1 M DTT (pH 8.3/Tris) for 
30 minutes to cleave the disulfide linkage between the dTig code and the N-ras 
targeting sequence. The beads were subsequently spun down and the dTig bar code 
was recovered in the supernatant. One half the supernatant was then added to ca. 1 00 
|il of IM KCl buffer bathing a single alpha hemolysin nanopore. The recovered bar 
codes were then counted by translocating them through the pore at 1 50 mV applied 
potential and identifying dTig-specific channel blockades. A diagram of the above 
protocol is provided in Fig. 9. 

B. N-ras specific bar codes detected by the alpha hemolysin nanopore. 

Fig. 10 provides the results following detection of the bar codes obtained from 
A above. Panel A of Fig, 10 shows the unimpeded current through a single alpha 
hemolysin channel at 150 mV in IM KCl (HEPES/pH 8.0). Panel B shows the current 
through the same charmel following addition of bar codes that had been cleaved from 
the N-ras antisense strand hybridized to N-ras target beads. Panel C depicts an 
expanded view of a blockade pattern specific to the dTig code cleaved from the N-ras 
targeting sequence. Only blockade events with this specific pattern were counted as 
positive results. These results are presented in the following Table 1 . 
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Table 1. 



Quantitative measure of N-ras targeted bar codes hybridized to a gene specific target 

and to a negative control. 

5 



D 



Bar Code Targeting Sequence 


Target Sequence 


N-ras Specific 

Bar Code Signatures 

(Counts per Assay + S.E.) 


N-ras a-sense 
50mer 


N-ras sense oligo 

Attached to polystyrene bead 


50+ 14 


N-ras a-sense 
50mer 


p53 sense oligo 
Attached to polystyrene 
bead 


<1 



The above results demonstrate that a molecular bar code can be specifically 
targeted to a macromolecule of interest, then cleaved from the targeting moiety at a 
10 linker, and finally quantified using a nanopore detector, 

IV. Synthesis of Molecular Bar Codes with Hairpin Structures 

A. Bar Code Synthesis. 

15 DNA oligonucleotide bar codes were synthesized by conventional 

phosphoramidite chemistry using an ABI 392 DNA Synthesizer. Each molecule was 
composed of 20 dC monomers at the 5' end, followed by a hairpin with a 4 dT turn 
and a 0-10 bp stem, and then by 20 additional dC monomers at the 3' end. As controls, 
oligonucleotides with the identical chemical compositions but scrambled sequences 

20 were also synthesized. These oligonucleotides were purified by PAGE and stored in 
TE buffer at -20 degrees centigrade. 

B. Characterization 

The effect of hairpin structure on oligonucleotide transport through a model 
25 nanopore was tested by adding purified preparations to a 70 |il bath containing 1 M 
KCl buffered at pH 8.0 (HEPES/KOH) and at room temperature. 80 to 200 mV 
potential was applied across a diphytanoyl phosphatidylcholine bi layer containing a 
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single a-hemolysin nanopore. This applied voltage permits capture and translocation 
of the anionic DNA strands by the nanopore which can be read as an abrupt decrease 
in monovalent ion current. The duration of the current blockade is an accurate measure 
of the time required for an individual strand to translocate across the nanopore. 
Typical blockades are shown in Figure 1 1 . 

Figure 12 shows that including hairpins with 2-to-lO base pair stems in DNA 
oligomers increases the blockade duration by 15-18 lis per base pair relative to 
randomized controls. A two-base pair difference is easily distinguishable. 
Furthermore, oligomers containing different number of base pairs but the same 
number of monomer units can be distinguished based on blockade duration alone 
given a sample size of one zeptomole (1000 molecules) (Figure 13). Thus, 
introduction of a hairpin into a synthetic DNA strand provides an unambiguous means 
to distinguish between two 'bar code' populations at high sensitivity based on residence 
time in the nanopore. 

The base pairing structure also acts as a built-in pause to provide a better 
signal-to-noise ratio for distinct portions of an encoded polymer. For example, a 
section of abasic units in a molecular bar code passing through a nanopore should give 
a transient increase in current because the smaller size of the units will block the 
current less. But, the speed at which these highly charged units pass through the 
channel makes them unobservable. To pause the molecular bar code and make the 
observation possible, a 6 base-pair stem was inserted into an oligonucleotide before 
and after a 20 monomer-long abasic segment (Figure 14). This resulted in an easily 
observable current transition during occupancy of the pore by the abasic segment of 
the encoded polymer (Figure 15). Specifically, when this molecule was translocated 
through the a-hemolysin nanopore under a 120 mV applied potential, a three part 
blockade signature was clearly evident in which the first blockade component was 
characteristic of poly dC (14 pA or 89% current reduction relative to the open 
channel), the second component was characteristic of a deoxyribose-phosphate 
polymer (43 pA or a 66% current reduction) and the last component was characteristic 
of random DNA sequence (13 pA or a 90% current reduction) (Figure lb). This 
experiment demonstrates that a synthetic polymer can be read by a nanopore, and that 
multiple component signals are detectable. 
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The above results demonstrate that inclusion of hairpin loop structures into 
molecular bar codes modulates the characteristics of the bar codes. These built-in 
secondary structures allow better control over the rate of polymer translocation 
through the nanopore. That is, encoded polymers made from extended, single-stranded 
DNA traverse the a-hemolysin nanopore at a rate of 1 - 3 jis/nucleotide. The above 
demonstrates that including hairpin structures with 2-to-lO base pair long stems 
increases the traversal time by 15-18 |as per base pair. This temporal control is 
important for two reasons: 1) slowing traversal permits a longer read time for each 
coding segment in the pore, which results in a much better signal to noise ratio and 
higher fidelity; and 2) building hairpins into the polymers allows one to use the 
traversal time as part of the code. For example, we have shown that oligomers 
containing different numbers of base pairs but the same number of nucleotides can be 
distinguished from one-another with a sample size of 1000 molecules (one zeptomole). 
The above described secondary structures may be incorporated into any type of bar 
code to provide for modulated properties, where such bar codes also include those 
described in V supra, e.g. e.g. polyacrylic acid or polystyrene containing bar codes. In 
addition, non-DNA secondary structures could be built into the bar codes and to 
regulate translocation in like manner. 

V. Preparation of Synthetic Polymers for Use in Molecular Bar Codes via 
Nitroxide-Mediated Radical Polymerization 

Nitroxide-mediated radical polymerizations were used to prepare synthetic 
polymers, where the protocols employed are based on the 'BraslauA^ladimir Initiator.' 
See the relevant literature section, supra. This technique is used to effect "living" 
polymerizations with extremely low polydispersities and controlled molecular weights 
for a number of monomer families, including styrenes, acrylates, acrylic acids, and . 
acrylonitriles (Figure 16). Using the Braslau/Vladimir initiator, controlled polymers 
(including block copolymers) bearing a variety of functionalities, including carboxylic 
acids, epoxides, amines, perfluoroalkyl groups, ethers, esters, amides, nitriles and 
substituted aromatic groups, are prepared. In addition, polymers with more complex 
topologies, such as comb, star and dendritic structures, are prepared. As such, this 
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method enables the production of a large number of unique bar codes that can be 
utilized in targeted molecular bar code structures. In other words, these preparation 
methods permit synthesis of molecular bar codes with a large selection of 
interchangeable polymer blocks. Furthermore, molecular bar codes with backbones 
inert in biological media can be synthesized, 

A representative protocol is as follows. Polymer bar codes are prepared using 
nitroxide-mediated living free radical polymerization with an alphabet of 
commercially available monomers. Both acrylates and styrenes have been shown to 
function well using the BraslauA^ladimir initiator. Initial bar codes are designed using 
these monomer classes as substrates. Water solubility is designed into the polymers 
by the use of hydrophilic aery late or methacrylate derivatives to form neutral 
hydrophilic backbones (Figure 17). Charged monomers are mixed into these 
backbones as either bulk random copolymers, gradient copolymers (achieved by 
syringe pump addition) or block copolymers (Figure 18). A few examples of each 
monomer class are shown here, all of which are commercially available (Figure 19). 

Polymerizations are typically carried out at 120 with 0.05 equivalents of the 
BraslauA'^ladimir initiator and 50 to 200 equivalents of monomer. We have previously 
demonstrated random copolymerization using w-butyl acrylate and a variety of 
functionalized acrylate monomers (200 equivalents): good polydispersities (<1.2) are 
typically obtained with ratios of up to 80/20 /-butyl acrylate/copolymer. Vinyl 
phosphonates are also of interest. Block copolymers starting with acrylates followed 
by styrene have given good results using our initiator, Acrylate/acrylate block 
copolymers are of interest as well. 

Derivatizable monomers can also be introduced into the polymer bar codes 
(Figure 20). These can be protected forms of ionizable species. For example, r-butyl 
acrylate can be used as a protected form of acrylic acid: treatment of the polymer with 
TFA should liberate the carboxylic acids. Vinyl phosphonate esters can serve as a 
protected form of phosphonic acids. Alternatively, these derivatizable monomers can 
be used as handles for the introduction of additional charged or fluorescent moieties; 
these can be added after the polymerization has been effected. In addition, the use of 
monomers with reactive functionality offers the opportunity to introduce graphed arms 
onto the polymer chain. This can be used to manipulate the effective diameter of the 
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polymer chain, or to design more complex structures such as branched or comb 
polymers (Figure 21). The introduction of lipophilic and/or sterically bulky residues 
into the bar codes (pre or post-polymerization) provides further modulation of the bar 
code profile. 

After polymerization, the bar codes can be designed to be attached to the targeting 
peptides at either the beginning or terminal end. Hawker's route (Figure 22) can be 
used to prepare an initiator with a pendant primary hydroxy group, or reductive 
cleavage of the "living end" of the polymer with Zn/HOAc, Na/Napth or Sml2 can be 
used to leave a secondary hydroxy group on the bar code terminus (Figure 23). 

The above results demonstrate that a large number of diverse polymers that can be 
used as targeted molecular bar codes may be synthesized using nitroxide mediated 
polymerization protocols. 

VL Preparation of Labile Linkers 

Photolabile Linker : 

Photolabile linkers based on vanillin are synthesized as follows. Treatment of 
vanillin 1 with K2CO3, provides alkylation of the phenol with 6-Bromohexanoic acid, 
yielding 2. 2 is then nitrated with fuming HNO3 and acetic acid to yield 3. 3 may be 
protected at the carboxylic acid with TMS depending on the stability of the final 
product. This step may or may not be necessary. 4 will be reduced to 5 with NaBH4. 
At which point 5 will be phosphitilated to the corresponding phosphoramidite 6. 6 
Can be used in a synthesizer to couple the linker to an oligo, or in solution phase 
coupling. Below, a proposed synthesis is presented in scheme 1 . 
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Standard peptide coupling using BOP reagent to activate the carboxyl group, 
will allow the peptide to be attached under mild conditions. To cleave and release the 
bar code at the appropriate time, UV light at 365-400 nm will be illuminated on the 
compound. The reaction scheme is presented below in scheme 2 



Scheme 2 




The proposed mechanism for photolysis of o-nitrobenzyl carbonate ester is present in 
the literature, but no specific mechanism for the photolysis of the corresponding 
phosphodiester is presented. Therefore, in scheme 3, the mechanism for a carbonate 
ester is shown. The mechanism for the phosphodiester should resemble the carbonate. 
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Dimethoxytrityl (DMT) is a common protecting group for the use of solid 
5 phase synthesis of DNA and RNA. Dimethoxytrityl is generally used to selectively 
protect primary alcohols. It is easily cleaved under mildly acidic conditions, but stable 
to basic work-up. A dimethoxytrityl group is incorporated into the cleavable linker for 
conjugation of an oligo to a protein as follows. The linker has a carboxylic acid 
functional group side chain to couple with an amine functionality 10. Scheme 4 



10 describes the synthesis of this linker. 




Scheme 4 



6-Bromohexanoic acid may be protected, where desired, prior to coupling with 
the phenol. If it is protected with TMS, it yields 7 the silyl ester of 6-bromohexanoic 

acid. 7 then alkylates p-hydroxy methylbenzoate to yield 8. 7 behaves like the methyl 
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ester during Grignard reaction. If alkylation at the silyl ester occurs, it is necessary to 
reverse the addition order and form the tertiary alcohol before alkylating with the 6- 
Bromohexanoic acid. The next step involves a double Grignard reaction at the ester , 
yielding alcohol 9. If the TMS protection of the caboxylate is not necessary, three 
5 equivalents of the Grignard will be necessary. The tertiary alcohol is then substituted 
with CI using acetyl-Cl, converting the compound to the linker/protecting group 10. 
This halide is then coupled to a primary alcohol using the same procedure as DMT 
protection. Once coupled to an oligo, the DMT linker is then attached to an amine 
functionality using standard peptide coupling techniques. Treatment with 
10 trichloroacetic acid is the standard deprotection method for DMT. The linker is more 
labile than the DMT because it has three electron donating groups vs. the two on 
DMT, so a milder acid treatment cleaves it readily. The general scheme is listed 
below. 




^ PROTEIN 



OMe 




HO- | Bar Code 



It is evident from the above results and discussion that the subject targeted 
20 molecular bar codes constitute a significant advance in the field of analyte detection. 
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The subject bar codes can be read directly without amplification and labeling steps. 
Furthermore, the bar codes can be read rapidly in much shorter times than that 
required for methods in which other labels, such as fluorescent or radioactive labels, 
are employed. In addition, the labels can be read from a heterogeneous population and 
need not be isolated before being read. 

All publications and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. The citation of 
any publication is for its disclosure prior to the filing date and should not be construed 
as an admission that the present invention is not entitled to antedate such publication 
by virtue of prior invention. 

Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity of understanding, it is readily apparent 
to those of ordinary skill in the art in light of the teachings of this invention that certain 
changes and modifications may be made thereto without departing from the spirit or 
scope of the appended claims. 
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